SF-CNN: Deep Text Classification and Retrieval for Text Documents

نویسندگان

چکیده

Researchers and scientists need rapid access to text documents such as research papers, source code dissertations. Many are available on the Internet more time retrieve exact based keywords. An efficient classification algorithm for retrieving keyword words is required. The traditional performs less because it never considers words’ polysemy relationship between bag-of-words in To solve above problem, Semantic Featured Convolution Neural Networks (SF-CNN) proposed obtain key relationships among searching keywords build a structure matching correct documents. SF-CNN deep semantic-based bag-of-word representation document retrieval. Traditional learning methods Convolutional Network Recurrent use semantic bag-of-words. experiment performed with different datasets evaluating performance of method. classifies an accuracy 94% than algorithms.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DOC: Deep Open Classification of Text Documents

Traditional supervised learning makes the closed-world assumption that the classes appeared in the test data must have appeared in training. This also applies to text learning or text classification. As learning is used increasingly in dynamic open environments where some new/test documents may not belong to any of the training classes, identifying these novel documents during classification pr...

متن کامل

A New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier

With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...

متن کامل

Text Classification of Formatted Text Documents

We describe a multiclass text classification system for formatted text messages contained in the Rich Text Format fields of a structured database of military documents. This system uses a Part-Of–Speech tagger and a RuleBased Classifier to classify 80 different types of formatted messages.

متن کامل

Classification of text documents

The exponential growth of the internet has led to a great deal of interest in developing useful and efficient tools and software to assist users in searching the Web. Document retrieval, categorization, routing and filtering can all be formulated as classification problems. However, the complexity of natural languages and the extremely high dimensionality of the feature space of documents have ...

متن کامل

An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification

The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Intelligent Automation and Soft Computing

سال: 2023

ISSN: ['2326-005X', '1079-8587']

DOI: https://doi.org/10.32604/iasc.2023.027429